May 26, 2020
Python…
Just like the real python, it is huge in size(To study and understand). But ofcourse nothing is bigger than thirst of understanding the world of Datascience. Come on! Lets start to learn ..slowly we will deep dive and we will swim around the ocean of DataScience…
Python was introduced as a General Purpose language. It was initially used for Frontend, Backend, WebDevelopment. But Now it is mainly used for DataAnalytics due to the library specially built for this Analytics and Visualization.
Python can handle large amount of Data, Where R becomes slower to handle because it runs on RAM. But python runs in HardDisk where we dont need to worry about storage.
Link for downloding Python- https://www.python.org/downloads/ I installed python for MAC, you can install based on your OS. Since i installed python, i installed Jupyter Notebook, which is a web based interactive platform for running python code.
cd /usr/local/bin (python installation dir in my sys)
pip install jupyterlab
pip install notebook
jupyter notebook
Below are Python Libraries. Check the libraries and its purposes. When you know the purpose, it would be easy when we include them during programming.
Pandas is a dataframe in python. It is used for structured data operations and manipulations. Also used for DataPreprocessing and Modelling. Helps to have your input in structured way.
Scientific Python- Built on top of NumPy. It can be used for optimization and Advanced fourier transform.
Used for Symbolic Computations
Numerical Python-Deals with Numerical Values, Contains basic Linear Algebra function, Fourier Transform, Advanced Random Number Capability, Integration tools with low level languages like c and c++.
Designed for Visualization. But only for static images.
Helps to develop ML Models. Built on top of NumPy, SciPy, MatplotLib. It contains ML Algorithm.
Used to access data from different DB.
Helps to start something from start to end. Helps to scrap the social websites from start and dig through it.
Helps to generate interactive plots.
Helps to develelop interactive visualization.
Helps to devlop Statistical Models, and Tests(ANOVA, CHI-Square)
Used for IN/OUT operations and File Transfer operations.
Designed for Text Mining.
Used for scraping the websites. This is lower to Scrapy.
Used for accessing the web for IN and OUT operations
Hope it helps to some extent. We will start with python programming from our next.